AITopics | image and video

Collaborating Authors

image and video

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The FBI Wants 'Near Real-Time' Access to US License Plate Readers

WIREDMay-23-2026, 10:30:00 GMT

Plus: Google publishes a live exploit for an unpatched flaw, the feds arrest two men accused of creating thousands of nonconsensual deepfake nudes, and more. A WIRED investigation this week found that a former Phoenix police officer who owns a company that offers firearms training to Immigration and Customs enforcement was involved in six shootings, four of which were deadly . Meanwhile, a New York police officer's lawyer has been banned from Madison Square Garden amid a lawsuit the cop filed over injuries sustained during a boxing match at an MSG venue. The Take It Down Act went into effect in the United States this week, allowing people to demand that websites and other platforms remove their nonconsensual nudes. WIRED reached out to more than a dozen companies to give you a rundown on how to take action .

artificial intelligence, machine learning, real time system, (12 more...)

WIRED

Country:

Asia > Middle East > UAE (0.29)
North America > United States > New York (0.24)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications (1.00)
Information Technology > Architecture > Real Time Systems (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.35)

Add feedback

How to Make Apps and Websites Remove Your Nonconsensual Nudes

WIREDMay-19-2026, 09:30:00 GMT

Starting May 19, tech platforms in the US will have to start complying with the Take It Down Act. Here's how more than a dozen of the largest platforms are handling takedown demands for your nudes. Abstract collage illustration of woman face partially obscured by a glitching pixelated effect on a green background. Starting on Tuesday, May 19, tech platforms have to provide a way for people to report nonconsensual intimate images and videos, or NCII, uploaded to their platforms. The new requirement is thanks to the Take It Down Act, a law backed by First Lady Melania Trump that passed last year with bipartisan support.

machine learning, natural language, platform, (14 more...)

WIRED

Country:

North America > United States (1.00)
Asia > Middle East > UAE (0.28)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Add feedback

A Kid With a Fake Mustache Tricked an Online Age-Verification Tool

WIREDMay-6-2026, 21:24:18 GMT

To stop children from bypassing its age checks, Meta is revamping its age-verification tools with an AI system that analyzes images and videos for "visual cues," such as height and bone structure. Meta is beefing up its age-verification mechanisms with an AI system that analyzes images and videos on Instagram and Facebook for "visual cues," such as height and bone structure, to identify and delete accounts of users under the age of 13. The company announced the move amid a wave of cases in which hundreds of children have managed to evade social network access restrictions, even through simple tricks such as drawing on a mustache. The new approach is part of a series of measures Meta adopted as part of an AI-based security strategy designed to correct the limitations of traditional methods, which rely heavily on self-reported age. With this change, the company seeks to reduce the ease with which minors access platforms that, in theory, are restricted to them.

artificial intelligence, machine learning, meta, (14 more...)

WIRED

Country:

Europe (1.00)
North America > United States (0.30)
Asia > Middle East > UAE (0.29)

Genre: Press Release (0.55)

Industry:

Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Add feedback

T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition Chen Y eh 1 You-Ming Chang 1 Wei-Chen Chiu 1 Ning Y u

Neural Information Processing SystemsFeb-18-2026, 05:00:45 GMT

Warning: This paper contains inappropriate/harmful visual contents. While widespread access to the Internet and the rapid advancement of generative models boost people's creativity and productivity, the risk of encountering inappropriate or harmful content also increases.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Elon Musk's Alternate Grok Reality

Mother JonesJan-13-2026, 16:02:28 GMT

Amid a scandal over nonconsensual sexual images, Musk says his AI chatbot is a force for "truth and beauty." Get your news from a source that's not owned and controlled by oligarchs. In much of the world, Grok and its parent company both appear to be in serious trouble. After Grok, X's AI chatbot, has been used to generate sexualized and violent images of women and children, the social media company has faced a wave of backlash and censure, with new nationwide bans on accessing Grok in place and other consequences on the way. On Monday, the EU threatened to fine X under its broad Digital Services Act if it didn't act "quickly" to fix Grok, in the words of one regulator.

elon musk, grok, musk, (12 more...)

Mother Jones

Country:

Europe > United Kingdom (0.16)
Asia > Malaysia (0.15)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.05)
Asia > Indonesia (0.05)

Industry:

Media (1.00)
Law (1.00)
Government > Regional Government (0.71)
Information Technology > Services (0.69)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.97)

Add feedback

Grok Is Generating Sexual Content Far More Graphic Than What's on X

WIREDJan-7-2026, 21:47:56 GMT

Grok Is Generating Sexual Content Far More Graphic Than What's on X A WIRED review of outputs hosted on Grok's official website shows it's being used to create violent sexual images and videos, as well as content that includes apparent minors. Elon Musk's Grok chatbot has drawn outrage and calls for investigation after being used to flood X with "undressed" images of women and sexualized images of what appear to be minors. However, that's not the only way people have been using the AI to generate sexualized images. Grok's website and app, which are are separate from X, include sophisticated video generation that is not available on X and is being used to produce extremely graphic, sometimes violent, sexual imagery of adults that is vastly more explicit than images created by Grok on X. It may also have been used to create sexualized videos of apparent minors.

grok, video, wired, (15 more...)

WIRED

Country:

North America > United States > California (0.14)
Europe > United Kingdom > Wales (0.04)
Europe > Slovakia (0.04)
(2 more...)

Industry:

Media (1.00)
Information Technology > Security & Privacy (1.00)
Government (1.00)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

Elon Musk's Pornography Machine

The Atlantic - TechnologyJan-2-2026, 22:57:00 GMT

On X, sexual harassment and perhaps even child abuse are the latest memes. Earlier this week, some people on X began replying to photos with a very specific kind of request. "Put her in a bikini," "take her dress off," "spread her legs," and so on, they commanded Grok, the platform's built-in chatbot. Again and again, the bot complied, using photos of real people--celebrities and noncelebrities, including some who appear to be young children--and putting them in bikinis, revealing underwear, or sexual poses. By one estimate, Grok generated one nonconsensual sexual image every minute in a roughly 24-hour stretch.

elon musk, grok, pornography machine, (10 more...)

The Atlantic - Technology

Country: Europe > United Kingdom (0.05)

Industry:

Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.89)
Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.53)

Add feedback

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Neural Information Processing SystemsDec-24-2025, 23:22:07 GMT

Recent action recognition models have achieved impressive results by integrating objects, their locations and interactions. However, obtaining dense structured annotations for each frame is tedious and time-consuming, making these methods expensive to train and less scalable. At the same time, if a small set of annotated images is available, either within or outside the domain of interest, how could we leverage these for a video downstream task? We propose a learning framework StructureViT (SViT for short), which demonstrates how utilizing the structure of a small number of images only available during training can improve a video model. SViT relies on two key insights.

bringing image scene structure, frame-clip consistency, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.39)

Add feedback

Composing Concepts from Images and Videos via Concept-prompt Binding

Kong, Xianghao, Zhang, Zeyu, Guo, Yuwei, Zhao, Zhuoran, Zhang, Songchun, Rao, Anyi

arXiv.org Artificial IntelligenceDec-11-2025

Visual concept composition, which aims to integrate different elements from images and videos into a single, coherent visual output, still falls short in accurately extracting complex concepts from visual inputs and flexibly combining concepts from both images and videos. We introduce Bind & Compose, a one-shot method that enables flexible visual concept composition by binding visual concepts with corresponding prompt tokens and composing the target prompt with bound tokens from various sources. It adopts a hierarchical binder structure for cross-attention conditioning in Diffusion Transformers to encode visual concepts into corresponding prompt tokens for accurate decomposition of complex visual concepts. To improve concept-token binding accuracy, we design a Diversify-and-Absorb Mechanism that uses an extra absorbent token to eliminate the impact of concept-irrelevant details when training with diversified prompts. To enhance the compatibility between image and video concepts, we present a Temporal Disentanglement Strategy that decouples the training process of video concepts into two stages with a dual-branch binder structure for temporal modeling. Evaluations demonstrate that our method achieves superior concept consistency, prompt fidelity, and motion quality over existing approaches, opening up new possibilities for visual creativity.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2512.09824

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

How to glimpse a pre-AI internet

Popular ScienceDec-1-2025, 19:15:12 GMT

Slop Evader isn't meant as a solution, but it gives a temporary reprieve. Breakthroughs, discoveries, and DIY tips sent every weekday. A sizable portion of the internet has devolved into an AI-contaminated wasteland . While an easy solution remains elusive, a browser extension called Slop Evader offers a glimpse at what the internet to be only a few short years ago. While always prone to innumerable hazards, the online ecosystem is degrading largely due to the misuse of generative artificial intelligence content .

andrew paul, large language model, machine learning, (19 more...)

Popular Science

Country:

North America > United States > California (0.05)
Asia > Middle East > UAE > Dubai Emirate > Dubai (0.05)
Asia > Japan (0.05)

Genre: Research Report > New Finding (0.36)

Industry: Information Technology > Security & Privacy (0.30)

Technology:

Information Technology > Communications > Social Media (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.50)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)

Add feedback